🐿️ ScourBrowse
LoginSign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
👁️ Perceptual Coding

Psychoacoustic Models, Lossy Compression, Human Perception, Audio Quality

JSQA: Speech Quality Assessment with Perceptually-Inspired Contrastive Pretraining Based on JND Audio Pairs
arxiv.org·6h
🎧Learned Audio
CompressedVQA-HDR: Generalized Full-reference and No-reference Quality Assessment Models for Compressed High Dynamic Range Videos
arxiv.org·6h
🎬AV1 Encoding
2025-07-16: Understanding Hallucination in Large Language Models: Challenges and Opportunities
ws-dl.blogspot.com·10h·
Discuss: ws-dl.blogspot.com
✨Effect Handlers
Investigating claims that GPUs can unlock "limitless music production potential"
musicradar.com·18h·
Discuss: Hacker News
🎧Learned Audio
Deep Neural Encoder-Decoder Model to Relate fMRI Brain Activity with Naturalistic Stimuli
arxiv.org·6h
🧠Neural Codecs
Measuring and predicting visual fidelity
arxiv.org·6h
🌈Color Science
Got buyer's remorse with your 8GB graphics card? Nvidia's AI texture compression promises huge benefits for GPUs with stingy amounts of memory
techradar.com·15h
🖥️Terminal Renaissance
Intel releases new tool to measure gaming image quality in real time —AI tool measures impact of upscalers, frame gen, others; Computer Graphics Video Quality M...
tomshardware.com·22h
📊Rate-Distortion Theory
Boffins detail new algorithms to losslessly boost AI perf by up to 2.8x
theregister.com·49m
💻Local LLMs
Machine Learning Fundamentals: dimensionality reduction
dev.to·19h·
Discuss: DEV
📐Linear Algebra
A Neural Net For a Graphing Calculator?
hackaday.com·2h
🤖Advanced OCR
A Multimodal Data Fusion Generative Adversarial Network for Real Time Underwater Sound Speed Field Construction
arxiv.org·6h
🎧Vorbis Encoding
Mitigating Object Hallucinations via Sentence-Level Early Intervention
arxiv.org·6h
👂Psychoacoustic Coding
Can AI really code? Study maps the roadblocks to autonomous software engineering
news.mit.edu·13h
📏Code Metrics
A Minimal DDPM
github.com·1d·
Discuss: Hacker News
🧠Machine Learning
Large Language Models and Non-Negative Matrix Factorization for Bioacoustic Signal Decomposition
arxiv.org·2d
👂Psychoacoustic Coding
The Man Behind the Sound: Demystifying Audio Private Attribute Profiling via Multimodal Large Language Model Agents
arxiv.org·2d
🎵Audio ML
Evaluating Image Compression Tools
rachelplusplus.me.uk·4d·
Discuss: Hacker News
📊Rate-Distortion Theory
COLI: A Hierarchical Efficient Compressor for Large Images
arxiv.org·1d
🧠Neural Compression
[R][D] Interpretability as a Side Effect? Are Activation Functions Biasing Your Models?
reddit.com·1d·
Discuss: r/MachineLearning
📊Learned Metrics
Loading...Loading more...
AboutBlogChangelogRoadmap